Intoxicated Speech Detection using MFCC Feature Extraction and Vector Quantization

نویسندگان

Risha Mal

R. K. Sharma

Naveen Kumar

چکیده

This study has been done on a technique which is suitable for tapping the telephonic conversation from a remote location to identify intoxication and consequent impaired brain activity that may cause criminal events e.g. DUI (driving under influence). This technique is time efficient, easy to use, non–invasive for the peoples and affordable for law enforcement personnel, bartenders/servers, court of law, coworkers/supervisors, clinicians, teachers and individuals who need to identify the presence and level of intoxication state in other peoples. The peaks in log Mel Filter Bank are main cues for identifying the sounds of speech. If a person is found drunk and his/her voice shows a great deal of variation, then this study describes an effective unsupervised method for query-by-audio sample speaker retrieval firstly by extracting MFCC features and then VQ (vector quantization) algorithms on the alcoholic audios. This method is also supported by verifying some speech parameters (fundamental frequency, jitter, shimmer). A set of twelve mel-frequency cepstrum coefficients computed every 10ms and which resulted the best performance i.e. 95% recognition with each of 8 speakers. The superior performance of the mel-frequency cepstrum coefficients may be an attributed to the fact that they better represent the perceptually relevant aspects of the short-terms speech spectrum.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Speaker Recognition Using Fuzzy Vector Quantization

Speaker recognition (SR) is a dynamic biometric task. SR is a multidisplinary problem that encompasses many aspects of human speech, including speech recognition, language recognition, and speech accents. This technique makes it possible to use the speaker’s voice to verify his/her identity and provide controlled access to services. The Mel-frequency extraction method is leading approach for sp...

متن کامل

Speaker Dependent Word Recognition Using MFCC and VQ

The paper present effective method for recognition of digit, numbers. Most of speech recognition systems contain two main modules as follow “feature extraction” and “feature matching”. In this project, (MFCC) Mel Frequency Cepstrum coefficient algorithm is used to simulate feature extraction module. Using this algorithm, the Cepstral Coefficients are calculated on Mel frequency scale. VQ (vecto...

متن کامل

Comparative Study of MFCC And LPC Algorithms for Gujrati Isolated Word Recognition

The study performs feature extraction for isolated word recognition using Mel-Frequency Cepstral Coefficient (MFCC) for Gujarati language. It explains feature extraction methods MFCC and Linear Predictive Coding (LPC) in brief. The paper compares the performances of MFCC and LPC features under Vector Quantization (VQ) method. The dataset comprising of males and females voices were trained and t...

متن کامل

Automatic Speaker Recognition using LPCC and MFCC

A person's voice contains various parameters that convey information such as emotion, gender, attitude, health and identity. This report talks about speaker recognition which deals with the subject of identifying a person based on their unique voiceprint present in their speech data. Pre-processing of the speech signal is performed before voice feature extraction. This process ensures the voice...

متن کامل

Spoken Language Identification Using Hybrid Feature Extraction Methods

This paper introduces and motivates the use of hybrid robust feature extraction technique for spoken language identification (LID) sys tem. The speech recognizers use a parametric form of a signal to get the most important distinguishable features of speech signal for recognition task. In this paper Mel-frequency cepstral coefficients (MFCC), Perceptual linear prediction coefficients (PLP) alon...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2014

Intoxicated Speech Detection using MFCC Feature Extraction and Vector Quantization

نویسندگان

چکیده

منابع مشابه

Automatic Speaker Recognition Using Fuzzy Vector Quantization

Speaker Dependent Word Recognition Using MFCC and VQ

Comparative Study of MFCC And LPC Algorithms for Gujrati Isolated Word Recognition

Automatic Speaker Recognition using LPCC and MFCC

Spoken Language Identification Using Hybrid Feature Extraction Methods

عنوان ژورنال:

اشتراک گذاری